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Claims 

1 . (Original) In a computer system, a method of processing images in a sequence of 
video images, the method comprising: 

determining a fraction for a current image in the sequence, wherein the fraction 
represents an estimated temporal distance position for the current image relative to an interval 
between a first reference image for the current image and a second reference image for the 
current image; and 

processing the fraction along with a motion vector for the first reference image, wherein 
the motion vector represents motion in the first reference image relative to a second reference 
image for the current image, and wherein the processing the fraction along with the motion 
vector results in a representation of motion in the current image relative to the first reference 
image. 

2. (Original) The method of claim 1 wherein the fraction is represented by a variable 
length code in a bit stream. 

3. (Original) The method of claim 1 wherein the fraction is selected from a set of 
discrete values, wherein the values are greater than zero and less than one. 

4. (Original) The method of claim 1 wherein the fraction is selected from the group 
consisting of: 1/2, 1/3, 2/3, 1/4, 3/4, 1/5, 2/5, 3/5, 4/5, 1/6, 5/6, 1/7, 2/7, and 3/7. 

* 

5. (Original) The method of claim 1 wherein the estimated temporal position for the 
current image relative to the interval between the first reference image for the current image and 
the second reference image for the current image is not the true temporal position of the current 
image. 

6. (Original) The method of claim 1 wherein the fraction is based on motion information 
for the sequence of video images. 
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7. (Original) The method of claim 1 wherein the fraction is based on a proximity of the 
current image to an end of the sequence of video images. 

8. (Original) The method of claim 1 wherein the representation of motion in the current 
image comprises representation of variable velocity motion. 

9. (Original) The method of claim 1 wherein the determining the fraction comprises: 
evaluating a set of plural fractions to determine bit costs for encoding the current image 

using the plural fractions; and 

V 

selecting a fraction from the set of plural fractions based on the evaluating. 

10. (Original) The method of claim 1 further comprising repeating the acts of claim 1 
for plural images in the sequence of video images. 

1 1 . (Original) A computer readable medium storing computer executable instructions 
for causing the computer system to perform the method of claim 1 during video encoding. 

12. (Original) A computer readable medium storing computer executable instructions 
for causing the computer system to perform the method of claim 1 during video decoding. 

13. (Original) In a computer system, a method of processing images in a sequence of 

« 

video images, the method comprising: 

determining a fraction for a region of a current image in the sequence, wherein the 
fraction represents an estimated temporal distance position for the current image relative to an 
interval between a first reference image for the current image and a second reference image for 
the current image; and 

processing the fraction along with a motion vector for the first reference image, wherein 
the motion vector represents motion in the first reference image relative to the second reference 
image, and wherein the processing the fraction along with the motion vector results in a 
representation of motion in the region of the current image. 
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14. (Original) The method of claim 13 wherein the region is a slice of the current image. 

15. (Original) The method of claim 1 3 wherein the region is a macroblock of the current 

image. 

16. (Original) The method of claim 13 further comprising repeating the acts of claim 13 
for plural regions in the current image. 

17. (Original) The method of claim 16 wherein a fraction for a first region in the current 
image differs from a fraction for a second region in the current image. 

18. (Original) In a computer system, a method of encoding images in a sequence of 
video images, the method comprising: 

determining a fraction for a current image in the sequence, wherein the current image has 
a previous reference image and a future reference image, and wherein the fraction represents a 
temporal position for the current image relative to its reference images; 

selecting direct mode prediction for a current macroblock in the current image; 

finding a motion vector for a co-located macroblock in the future reference image; 

scaling the motion vector for the co-located macroblock using the fraction. 

19. (Original) The method of claim 18 wherein the fraction facilitates representation of 
variable velocity motion in the direct mode prediction. 

20. (Original) The method of claim 18 wherein the scaling the motion vector for the co- 
located macroblock comprises scaling the vertical component and horizontal components of the 
motion vector for the co-located macroblock. 

2 1 . (Original) The method of claim 1 8 wherein the scaling the motion vector for the co- 
located macroblock comprises: 

scaling the motion vector for the co-located macroblock by a factor of the fraction, to 
obtain an implied forward motion vector for the current macroblock; and 
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scaling the motion vector for the co-located macroblock by a factor of the fraction minus 
one, to obtain an implied backward motion vector for the current macroblock. 

22. (Original) The method of claim 21 further comprising: 

addressing a macroblock in the future reference frame using the implied forward motion 

vector; 

addressing a macroblock in the previous reference frame using the implied backward 
motion vector; and 

predicting the current macroblock using an average of the macroblock in the future 
reference frame and the macroblock in the previous reference frame. 

23. (Original) A computer readable medium storing computer executable instructions 
for causing the computer system to perform the method of claim 1 8 during video encoding. 

24. (Original) In a computer system, a method of processing images in a sequence of 
video images, the method comprising: 

determining a temporal position of a current image in the sequence, wherein the current 
image has plural references, wherein the temporal position is between a first reference image 
based on at least one reference for the current image and a second reference image for the current 
image, and wherein the temporal position is determined independent of time stamps; and 

processing the current image based on the temporal position of the current image and a 
motion vector for the first at least one reference image, wherein the motion vector represents 
motion in the first reference image relative to the second reference image, and wherein the 
processing results in a representation of motion in the current image. 

25. (Original) The method of claim 24 wherein the determining is based on a fixed inter- 
frame distance and a fixed number of images with plural references within the sequence of video 
images. 

26. (Original) A computer readable medium storing computer executable instructions 
for causing the computer system to perform the method of claim 24 during video encoding. 
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27. (Original) A computer readable medium storing computer executable instructions 
for causing the computer system to perform the method of claim 24 during video decoding. 

28. (Original) In a computer system, a method of encoding a current image in a 
sequence of video images, the current image having at least two reference images in the 
sequence, the method comprising: 

analyzing the at least two reference images along with the current image to determine 
whether the current image is to be predictively encoded based on the at least two reference 
images; 

based on the analyzing, encoding the current image independently from the at least two 
reference images; and 

assigning an image type to the current image, wherein the image type indicates that the 
current image is encoded independently from the at least two reference images. 

29. (Original) The method of claim 28 wherein the analyzing comprises analyzing 
motion in the current image. 

30. (Original) The method of claim 28 wherein the image type is encoded as a variable 
length code from a variable length code table. 

3 1 . (Original) The method of claim 30 wherein other variable length codes in the 
variable length code table represent fractions for representing the temporal position of images in 
the sequence of video images. 

32. (Original) The method of claim 30 wherein other variable length codes in the 
variable length code table represent other image types. 

33. (Original) The method of claim 28 further comprising designating the current image 
as a non-reference image, wherein the image type further indicates that the current image is a 
non-reference image. 
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34. (Original) In a video decoder, a method of decoding a current image in an encoded 
video image sequence, the current image having at least two reference images in the video image 
sequence, wherein the decoding yields a decoded video stream, the method comprising: 

receiving an image type for the current image, wherein the image type indicates that the 
current image is encoded independently from the at least two reference images, and 
analyzing bit rate constraints for the decoding; and 

determining whether to omit the current image from the decoded video stream based on 
the analyzing and the image type for the current image. 

35. (Original) The method of claim 34 wherein the video decoder maintains a bias in 
favor of omitting the current image relative to encoded images having different frame types. 

36. (Original) In a computer system, a computer-implemented method of processing 
video images in a video image sequence, the method comprising: 

processing a bit plane for a bi-directionally predicted video image, wherein the bit plane 
comprises binary information signifying whether macroblocks in the bi-directionally predicted 
video image are encoded using direct mode prediction or non-direct mode prediction. 

37. (Original) The method of claim 36 wherein the bit plane is a frame level bit plane. 

38. (Original) The method of claim 36 further comprising, when the binary information 
indicates a non-direct prediction mode for a macroblock, processing a variable length code in a 
variable length coding table, wherein the variable length code represents the non-direct 
prediction mode. 

39. (Original) The method of claim 36 further comprising, when the binary information 
indicates a non-direct prediction mode for a macroblock, processing a variable length code in a 
variable length coding table, wherein the non-direct prediction mode is intra mode and is 
indicated by a combination of the variable length code and a motion vector code, and wherein 
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the variable length code matches a variable length code used to represent a different non-direct 
prediction mode. 

40. (Original) The method of claim 36 further comprising, when the binary information 
indicates a non-direct prediction mode for a macroblock, representing the non-direct prediction 
mode with a variable length code from a variable length coding table comprising plural variable 
length codes, the plural variable length codes representing plural non-direct prediction modes, 
wherein a determination of which of the plural variable length codes represents which of the 
plural non-direct prediction modes is based at least in part on a fraction representing a temporal 
position of the a bi-directionally predicted video image. 

41. (Original) The method of claim 40 wherein the variable length coding table includes 
a code length preference for backward mode. 

42. (Original) A computer readable medium storing computer executable instructions 
for causing the computer system to perform the method of claim 36 during video encoding. 

43. (Original) A computer readable medium storing computer executable instructions 
for causing the computer system to perform the method of claim 36 during video decoding. 

44. (Original) In a computer system, a method of processing images in a sequence of 
video images, the method comprising: 

determining a value representing a forward motion vector component for a macroblock in 
the current image; 

determining a value representing a backward motion vector component for the 
macroblock in the current image; 

adding the value representing the forward motion vector to a forward buffer; 

adding the value representing the backward motion vector to a backward buffer; and 
predicting motion vectors for other macroblocks in the current image using values in the forward 
buffer and values in the backward buffer. 
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45. (Original) The method of claim 44 wherein the predicting comprises finding a 
median of previously decoded neighboring motion vectors. 

46. (Original) The method of claim 44 wherein the values in the forward buffer are used 
to predict motion vectors for forward-predicted macroblocks. 

47. (Original) The method of claim 44 wherein the values in the backward buffer are 
used to predict motion vectors for backward-predicted macroblocks. 

48. (Original) In a computer system, a method of processing an image in a sequence of 
video images, the method comprising: 

for a direct mode predicted macroblock in the image: 

determining a non-zero value representing a forward motion vector component for 
the direct mode predicted macroblock; 

determining a non-zero value representing a backward motion vector component 
for the direct mode predicted macroblock; 

adding the non-zero values to one or more buffers; 

wherein values in the one or more buffers are used to predict motion vectors for 
other macroblocks in the image. 

49. (Original) In a computer system, a method of estimating motion for a bi- 
directionally predicted image in a sequence of video images, wherein the bi-directionally 
predicted image comprises macroblocks, and wherein the bi-directionally predicted image has a 
first reference image and a second reference image, the method comprising: 

selecting a motion vector resolution for the bi-directionally predicted image from among 
plural motion vector resolutions; 

selecting an interpolation filter for the bi-directionally predicted image from among 
plural interpolation filters; and 

encoding the bi-directionally predicted image using the selected motion vector resolution 
and the selected interpolation filter. 
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50. (Original) The method of claim 49 wherein the plural motion vector resolutions 
include a half-pixel resolution and a quarter pixel resolution. 

5 1 . (Original) The method of claim 49 wherein the plural interpolation filters include a 
bicubic interpolation filter and a bilinear interpolation filter. 

52. (Original) The method of claim 49 wherein the bi-directionally predicted image is 
predicted using only one motion Vjector per macroblock in the bi-directionally predicted image. 

53. (Original) The method of claim 52, the method further comprising applying a four- 
motion-vector to one-motion- vector conversion to a macroblock in the first reference image, the 
macroblock in the first reference image having four motion vectors. 

54. (Original) The method of claim 53 wherein the conversion comprises: 
determining a median vertical motion vector component among four vertical motion 

vector components of the four motion vectors of the macroblock in the first reference image; 

determining a median horizontal motion vector component among four horizontal motion 
vector components of the four motion vectors of the macroblock in the first reference image; and 

determining a median motion vector based on the median vertical motion vector 
component and the median horizontal motion vector component. 

55. (Original) In a computer system, a method of predicting motion for a bi- 
directionally predicted image in a sequence of video images, wherein the bi-directionally 
predicted image comprises macroblocks, and wherein the bi-directionally predicted image has a 
first reference image and a second reference image, the method comprising: 

selecting a motion vector resolution for the bi-directionally predicted image from among 
plural motion vector resolutions, wherein the plural motion vector resolutions include a half- 
pixel resolution and a quarter pixel resolution; 

selecting an interpolation filter for the bi-directionally predicted image from among 
plural interpolation filters, wherein the plural interpolation filters include a bicubic interpolation 
filter and a bilinear interpolation filter; and 
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encoding the bi-directionally predicted image using the selected motion vector resolution 
and the selected interpolation filter. 

56. (Original) In a computer system, a method of predicting motion for a bi- 
directionally predicted image in a sequence of video images, wherein the bi-directionally 
predicted image comprises macroblocks, and wherein the bi-directionally predicted image has a 
first reference image and a second reference image, the method comprising: 

selecting a motion vector mode for the bi-directionally predicted image from a set of 
plural motion vector modes, wherein the set of plural motion vector modes includes: 

a one motion vector, quarter-pixel resolution, bicubic interpolation filter mode; 
a one motion vector, half-pixel resolution, bicubic interpolation filter mode; and 
a one motion vector, half-pixel resolution, bilinear interpolation filter mode; and 

encoding the bi-directionally predicted image using the selected motion vector mode. 

57. (Original) The method of claim 56 wherein the selecting is based on an efficiency 
evaluation of encoding the bi-directionally predicted image using one or more of the plural 
motion vector modes. 

58. (Original) In a computer system, a method of processing images in a video image 
sequence to yield a processed video image sequence, wherein the processing is performed at a 
constrained bit rate, the method comprising: 

monitoring bits used during the processing; 

based on the monitoring, determining whether to omit a current image having two 
reference images from the processed video image sequence, wherein the current image has a 
number of bits required to process the current image; 

wherein, at the time of the determining, a number of bits available for use in the 
processing is greater than or equal to the number of bits required to process the current image. 

59. (Original) The method of claim 58 wherein the determining is further based on a 
comparison of the number of bits required to process the current image with a number of bits 
available for use prior to processing the current image. 
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60. (Original) The method of claim 58 wherein the monitoring comprises comparing the 
bits used during the processing with a threshold number of bits. 

61 . (Original) The method of claim 60 wherein the threshold is adaptively adjusted 
during the processing. 

62. (Original) The method of claim 58 wherein the current image is a non-reference 

image. 

63. (Original) The method of claim 58 wherein a bit stream syntax for the video image 
sequence includes plural levels, and wherein the determining is performed at frame level. 

64. (Original) A computer readable medium storing computer executable instructions 
for causing the computer system to perform the method of claim 58 during video encoding. 

65. (Original) A computer readable medium storing computer executable instructions 
for causing the computer system to perform the method of claim 58 during video decoding. 

66. (Original) In a computer system, a method of processing images in a video image 
sequence to yield a processed video image sequence, wherein the images in the video image 
sequence comprise images having two reference images, wherein the processing is performed at 
a constrained bit rate, and wherein images in the video image sequence are operable to be 
omitted from the processed video image sequence based on the constrained bit rate, the method 
comprising: 

determining whether to omit a current image having two reference images from the 
processed video image sequence, wherein the current image has a number of bits required to 
process the current image, and wherein the determining comprises: 

if more than half of n images processed prior to the current image were omitted 
from the processed video image sequence, then omitting the current image from the 
processed video sequence if the number of bits required to process the current image is 



Page 12 of 16 



BCF/KBR:amcl 02/05/07 634884 303514.1 Attorney Reference Number 3382-64472-01 

PATENT Application Number 1 0/622,378 

greater than the average bits per image used to process the n images processed prior to 
the current image; and 

if half or less than half of the n images processed prior to the current image were 
omitted from the processed video image sequence, then omitting the current image from 
the processed video sequence if the number of bits required to process the current image 
is greater than the twice the average bits per image used to process the n images 
processed prior to the current image. 

67. (Original) The method of claim 66 wherein the determining is based in part on 
image type statistics for images in the video image sequence. 

68. (Original) A method of processing a video image sequence, wherein the processing 
yields an encoded video image sequence in a bit stream having plural bit stream levels, wherein 
the plural bit stream levels include a frame level, and wherein the video image sequence 
comprises predicted images, the method comprising: 

omitting a predicted image in the video image sequence from the encoded video image 
sequence; and 

representing the omitted predicted image with a frame-level indicator in the bit stream; 
wherein the frame-level indicator is operable to indicate the omitted predicted image to a 
video decoder. 

69. (Original) The method of claim 68 wherein the frame-level indicator is data having 
an indicator size, and wherein the indicator size indicates the omitted predicted image. 

70. (Original) The method of claim 69 wherein encoded images in the encoded video 
image sequence have a minimum encoded image size, and wherein the indicator size is smaller 
than the minimum encoded image size. 

71 . (Original) The method of claim 69 wherein the indicator size is one byte. 
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72. (Original) The method of claim 68 wherein the frame-level indicator causes the 
video decoder to choose one or more reference images for a predicted image. 

73. (Original) The method of claim 72 wherein the predicted image is a bi-directionally 
predicted image. 
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